Picture for Kui Ren

Kui Ren

School of Cyber Science and Technology, Zhejiang University

DualBreach: Efficient Dual-Jailbreaking via Target-Driven Initialization and Multi-Target Optimization

Add code
Apr 21, 2025
Viaarxiv icon

ControlNET: A Firewall for RAG-based LLM System

Add code
Apr 13, 2025
Viaarxiv icon

Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models

Add code
Mar 28, 2025
Viaarxiv icon

Towards LLM Guardrails via Sparse Representation Steering

Add code
Mar 21, 2025
Viaarxiv icon

Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models

Add code
Mar 17, 2025
Viaarxiv icon

Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models

Add code
Mar 12, 2025
Viaarxiv icon

Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation

Add code
Mar 09, 2025
Viaarxiv icon

Towards Collaborative Anti-Money Laundering Among Financial Institutions

Add code
Feb 27, 2025
Viaarxiv icon

Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon

REFINE: Inversion-Free Backdoor Defense via Model Reprogramming

Add code
Feb 22, 2025
Viaarxiv icon